IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant 



Jan Zavada et al . 



Serial No. : 09/967,237 



Group Art Unit: 1642 



Filed 



September 27, 2 001 



Examiner: David J. Blanchard 



For 



MN Gene and Protein 



SUBMISSION OF SUBSTITUTE SEQUENCE LISTING 



Mail Stop Amendment 
Commissioner for Patents 
P.O. Box 1450 
Alexandria, Va 22313-1450 

Sir: 



Applicants submit the enclosed substitute Sequence 



Listing of the nucleotide and amino acid sequences contained in 
the above- identified application. Also enclosed is a computer 
readable copy of the substitute Sequence Listing . The nucleotide 
and amino acid sequences are presented in a form which conforms 
with the requirements of 37 CFR Sections 1.821 through 1.825. 



In accordance with 37 CFR Section 1.821(f), the 



undersigned Attorney for the Applicants hereby states that the 
information recorded in computer readable form is identical to 
that in the printed substitute Sequence Listing . Further, in 
accordance with 37 CFR Section 1.821(g), the undersigned Attorney 
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) 

for the Applicants states that the enclosed substitute Sequence 
Listing includes no new matter. 

Respectfully submitted, 

^Leona L ^ Laufrer 
Attorney for Applicants 
Registration No. 30,863 

Dated: July 22, 2004 
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SEQUENCE LISTING 

< 1 1 0 > Zavada , Jan 

Pastorekova, Silvia 
Pastorek, Jaromir 

<120> MN Gene and Protein 

<130> D-0021.5B-2 

<140> 09/967,237 
<141> 2001-09-27 

<150> 09/178,115 
<151> 1998-10-23 

<160> 116 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1522 
<212> DNA 
<213> HUMAN 

<220> 

<221> CDS 

<222> (13) . . (1389) 

<220> 

<221> mat_peptide 
<222> (124) . . (1389) 

<400> 1 

acagtcagcc gc atg get ccc ctg tgc ccc age ccc tgg etc cct ctg ttg 51 
Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu 
-35 -30 -25 

ate ccg gec cct get cca ggc etc act gtg caa ctg ctg ctg tea ctg 99 
lie Pro Ala Pro Ala Pro Gly Leu Thr Val Gin Leu Leu Leu Ser Leu 
-20 -15 -10 

ctg ctt ctg atg cct gtc cat ccc cag agg ttg ccc egg atg cag gag 147 
Leu Leu Leu Met Pro Val His Pro Gin Arg Leu Pro Arg Met Gin Glu 
-5 -11 5 

gat tec ccc ttg gga gga ggc tct tct ggg gaa gat gac cca ctg ggc 195 
Asp Ser Pro Leu Gly Gly Gly Ser Ser Gly Glu Asp Asp Pro Leu Gly 
10 15 20 

gag gag gat ctg ccc agt gaa gag gat tea ccc aga gag gag gat cca 243 
Glu Glu Asp Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp Pro 
25 30 35 40 

ccc gga gag gag gat eta cct gga gag gag gat eta cct gga gag gag 2 91 
Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu 
45 50 55 



gat eta cct gaa gtt aag cct aaa tea gaa gaa gag ggc tec ctg aag 33 9 
Asp Leu Pro Glu Val Lys Pro Lys Ser Glu Glu Glu Gly Ser Leu Lys 
60 65 70 



tta gag gat eta cct act gtt gag get . cct gga gat cct caa gaa ccc 3 87 
Leu Glu Asp Leu Pro Thr Val Glu Ala Pro Gly Asp Pro Gin Glu Pro 
75 80 85 

cag aat aat gee cac agg gac aaa gaa ggg gat gac cag agt cat tgg 4 35 
Gin Asn Asn Ala His Arg Asp Lys Glu Gly Asp Asp Gin Ser His Trp 
90 95 100 

cgc tat gga ggc gac ccg ccc tgg ccc egg gtg tec cca gee tgc gcg 483 
Arg Tyr Gly Gly Asp Pro Pro Trp Pro Arg Val Ser Pro Ala Cys Ala 
105 110 115 120 

ggc cgc ttc cag tec ccg gtg gat ate cgc ccc cag etc gee gee ttc 531 
Gly Arg Phe Gin Ser Pro Val Asp lie Arg Pro Gin Leu Ala Ala Phe 
125 130 135 

tgc ccg gee ctg cgc ccc ctg gaa etc ctg ggc ttc cag etc ccg ccg 579 
Cys Pro Ala Leu Arg Pro Leu Glu Leu Leu Gly Phe Gin Leu Pro Pro 
140 145 150 

etc cca gaa ctg cgc ctg cgc aac aat ggc cac agt gtg caa ctg acc 627 
Leu Pro Glu Leu Arg Leu Arg Asn Asn Gly His Ser Val Gin Leu Thr 
155 160 165 

ctg cct cct ggg eta gag atg get ctg ggt ccc ggg egg gag tac egg 675 
Leu Pro Pro Gly Leu Glu Met Ala Leu Gly Pro Gly Arg Glu Tyr Arg 
170 175 180 

get ctg cag ctg cat ctg cac tgg ggg get gca ggt cgt ccg ggc teg 723 
Ala Leu Gin Leu His Leu His Trp Gly Ala Ala Gly Arg Pro Gly Ser 
185 190 195 200 

gag cac act gtg gaa ggc cac cgt ttc cct gee gag ate cac gtg gtt 771 
Glu His Thr Val Glu Gly His Arg Phe Pro Ala Glu lie His Val Val 
205 210 215 

cac etc age acc gee ttt gee aga gtt gac gag gee ttg ggg cgc ccg 819 
His Leu Ser Thr Ala Phe Ala Arg Val Asp Glu Ala Leu Gly Arg Pro 
220 225 230 

gga ggc ctg gec gtg ttg gec- gee ttt ctg gag gag ggc ccg gaa gaa 867 
Gly Gly Leu Ala Val Leu Ala Ala Phe Leu Glu Glu Gly Pro Glu Glu 
235 240 245 

aac agt gee tat gag cag ttg ctg tct cgc ttg gaa gaa ate get gag 915 
Asn Ser Ala Tyr Glu Gin Leu Leu Ser Arg Leu Glu Glu lie Ala Glu 
250 255 260 

gaa ggc tea gag act cag gtc cca gga ctg gac ata tct gca etc ctg 963 
Glu Gly Ser Glu Thr Gin Val Pro Gly Leu Asp lie Ser Ala Leu Leu 
265 270 275 280 
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ccc tct gac ttc age cgc tac ttc caa tat gag ggg tct ctg act aca 1011 
Pro Ser Asp Phe Ser Arg Tyr Phe Gin Tyr Glu Gly Ser Leu Thr Thr 
285 290 295 

ccg ccc tgt gec cag ggt gtc ate tgg act gtg ttt aac cag aca gtg 1059 
Pro Pro Cys Ala Gin Gly Val lie Trp Thr Val Phe Asn Gin Thr Val 
300 305 310 

atg ctg agt get aag cag etc cac acc etc tct gac acc ctg tgg gga 1107 
Met Leu Ser Ala Lys Gin Leu His Thr Leu Ser Asp Thr Leu Trp Gly 
315 320 325 

cct ggt gac tct egg eta cag ctg aac ttc cga gcg acg cag cct ttg 1155 
Pro Gly Asp Ser Arg Leu Gin Leu Asn Phe Arg Ala Thr Gin Pro Leu 
330 335 340 

aat ggg cga gtg att gag gee tec ttc cct get gga gtg gac age agt 1203 
Asn Gly Arg Val lie Glu Ala Ser Phe Pro Ala Gly Val Asp Ser Ser 
345 350 355 360 

cct egg get get gag cca gtc cag ctg aat tec tgc ctg get get ggt 1251 
Pro Arg Ala Ala Glu Pro Val Gin Leu Asn Ser Cys Leu Ala Ala Gly 
365 370 375 

gac ate eta gee ctg gtt ttt ggc etc ctt ttt get gtc acc age gtc 1299 
Asp lie Leu Ala Leu Val Phe Gly Leu Leu Phe Ala Val Thr Ser Val 
380 385 390 

gcg ttc ctt gtg cag atg aga agg cag cac aga agg gga acc aaa ggg 1347 
Ala Phe Leu Val Gin Met Arg Arg Gin His Arg Arg Gly Thr Lys Gly 
395 400 405 

ggt gtg age tac cgc cca gca gag gta gee gag act gga gee 13 8 9 

Gly Val Ser Tyr- Arg Pro Ala Glu Val Ala Glu Thr Gly Ala 
410 415 420 

tagaggctgg atcttggaga atgtgagaag ccagccagag gcatctgagg gggagccggt 144 9 

aactgtcctg tcctgctcat tatgecaett ccttttaact gecaagaaat tttttaaaat 1509 

aaatatttat aat 1522 



<210> 2 
<211> 459 
<212> PRT 
<213> HUMAN 

<400> 2 

Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu lie Pro Ala 
-35 -30 -25 

Pro Ala Pro Gly Leu Thr Val Gin Leu Leu Leu Ser Leu Leu Leu Leu 
-20 -15 -10 

Met Pro Val His Pro Gin Arg Leu Pro Arg Met Gin Glu Asp Ser Pro 
-5 -11 5 10 
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Leu Gly 



Gly Gly 
15 



Ser 



Ser 



Gly 



Glu Asp Asp Pro Leu 
20 



Gly Glu 
25 



Glu Asp 



Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp Pro Pro Gly Glu 
30 35 .40 

Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro 
45 50 55 

Glu Val Lys Pro Lys Ser Glu Glu Glu Gly Ser Leu Lys Leu Glu Asp 
60 65 70 75 

Leu Pro Thr Val Glu Ala Pro Gly Asp Pro Gin Glu Pro Gin Asn Asn 
80 85 . 90 

Ala His Arg Asp Lys Glu Gly Asp Asp Gin Ser His Trp Arg Tyr Gly 
95 100 105 

Gly Asp Pro Pro Trp Pro Arg Val Ser Pro Ala Cys Ala Gly Arg Phe 
110 115 120 

Gin Ser Pro Val Asp lie Arg Pro Gin Leu Ala Ala Phe Cys Pro Ala 
125 130 135 

Leu Arg Pro Leu Glu Leu Leu Gly Phe Gin Leu Pro Pro Leu Pro Glu 
140 145 150 155 

Leu Arg Leu Arg Asn Asn Gly His Ser Val Gin Leu Thr Leu Pro Pro 
160 165 ' 170 

Gly Leu Glu Met Ala Leu Gly Pro Gly Arg Glu Tyr Arg Ala Leu Gin 
175 180 185 

Leu His Leu His Trp Gly Ala Ala Gly Arg Pro Gly Ser Glu His Thr 
190 195 200 

Val Glu Gly His Arg Phe Pro Ala Glu lie His Val Val His Leu Ser 
205 210 215 

Thr Ala. Phe Ala Arg Val Asp Glu Ala Leu Gly Arg Pro Gly Gly Leu 
220 225 230 235 

Ala Val Leu Ala Ala Phe Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala 
240 245 250 

Tyr Glu Gin Leu Leu Ser Arg Leu Glu Glu lie Ala Glu Glu Gly Ser 
255 260 265 

Glu Thr Gin Val Pro Gly Leu Asp lie Ser Ala Leu Leu Pro Ser Asp 
270 275 280 

Phe Ser Arg Tyr Phe Gin Tyr Glu Gly Ser Leu Thr Thr Pro Pro Cys 
285 290 295 

Ala Gin Gly Val lie Trp Thr Val Phe Asn Gin Thr Val Met Leu Ser 
300 305 310 315 



Ala Lys Gin Leu 



Ser Arg Leu Gin 
335 

Val He Glu Ala 
350 

Ala Glu Pro Val 
365 

Ala Leu Val Phe 
380 

Val Gin Met Arg 



Tyr Arg Pro Ala 
415 



His Thr Leu Ser 
320 

Leu Asn Phe Arg 



Ser Phe Pro Ala 
355 

Gin Leu Asn Ser 
370 

Gly Leu Leu Phe 
385 

Arg Gin His Arg 
400 

Glu Val Ala Glu 



Asp Thr Leu Trp 
325 

Ala Thr Gin Pro 
340 

Gly Val Asp Ser 



Cys Leu Ala Ala 
375 

Ala Val Thr Ser 
390 

Arg Gly Thr Lys 
405 

Thr Gly Ala 
420 



Gly Pro Gly Asp 
330 

Leu Asn Gly Arg 
345 

Ser Pro Arg Ala 
360 

Gly Asp He Leu 



Val Ala Phe Leu 
395 

Gly Gly Val Ser 
410 



<210> 3 

<211> 29 

<212> DNA 

<213> HUMAN 

<400> 3 

cgcccagtgg gtcatcttcc ccagaagag 2 9 



<210> 4 
<211> 19 
<212> DNA 
<213> HUMAN 

<400> 4 

ggaatcctcc tgcatccgg 19 



<210> 5 

<211> 10898 

<212> DNA 

<213> HUMAN 

<220> 

<221> gene 

<222> (1) . . (10898) 

<223> full-length MN genomic sequence 
<220> 

<221> unsure of base at position 1974 
<222> (1974) 

<223> unsure of base at position 1974, which is in the 5' region flanking the 
transcription initiation site (3507) as determined by RNase protection assay. 
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<400> 5 

ggatcctgtt gactcgtgac cttaccccca accctgtgct ctctgaaaca tgagctgtgt 60 
ccactcaggg ttaaatggat taagggcggt gcaagatgtg ctttgttaaa cagatgcttg 120 
aaggcagcat gctcgttaag agtcatcacc aatccctaat ctcaagtaat cagggacaca 180 
aacactgcgg aaggccgcag ggtcctctgc ctaggaaaac cagagacctt tgttcacttg 240 
tttatctgac cttccctcca ctattgtcca tgaccctgcc aaatccccct ctgtgagaaa 300 
cacccaagaa ttatcaataa aaaaataaat ttaaaaaaaa aatacaaaaa aaaaaaaaaa 3 60 
aaaaaaaaaa gacttacgaa tagttattga taaatgaata gctattggta aagccaagta 420 
aatgatcata ttcaaaacca gacggccatc atcacagctc aagtctacct gatttgatct 480 
ctttatcatt gtcattcttt ggattcacta gattagtcat catcctcaaa attctccccc 540 
aagttctaat tacgttccaa acatttaggg gttacatgaa gcttgaacct actaccttct 600 
ttgcttttga gccatgagtt gtaggaatga tgagtttaca ccttacatgc tggggattaa 660 
tttaaacttt acctctaagt cagttgggta gcctttggct tatttttgta gctaattttg 720 
tagttaatgg atgcactgtg aatcttgcta tgatagtttt cctccacact ttgccactag 780 
gggtaggtag gtactcagtt ttcagtaatt gcttacctaa gaccctaagc cctatttctc 840 
ttgtactggc ctttatctgt aatatgggca tatttaatac aatataattt ttggagtttt 900 
tttgtttgtt tgtttgtttg tttttttgag acggagtctt gcatctgtca tgcccaggct 960 
99 a 9tagcag tggtgccatc tcggctcact gcaagctcca cctcccgagt tcacgccatt 1020 
ttcctgcctc agcctcccga gtagctggga ctacaggcgc ccgccaccat gcccggctaa 1080 
ttttttgtat ttttggtaga gacggggttt caccgtgtta gccagaatgg tctcgatctc 1140 
ctgacttcgt gatccacccg cctcggcctc ccaaagttct gggattacag gtgtgagcca 1200 
ccgcacctgg ccaatttttt gagtctttta aagtaaaaat atgtcttgta agctggtaac 1260 
tatggtacat ttccttttat taatgtggtg ctgacggtca tataggttct tttgagtttg 1320 
gcatgcatat gctacttttt gcagtccttt cattacattt ttctctcttc atttgaagag 1380 
catgttatat cttttagctt cacttggctt aaaaggttct ctcattagcc taacacagtg 1440 
tcattgttgg taccacttgg atcataagtg gaaaaacagt caagaaattg cacagtaata 1500 
cttgtttgta agagggatga ttcaggtgaa tctgacacta agaaactccc ctacctgagg 1560 
tctgagattc ctctgacatt gctgtatata ggcttttcct ttgacagcct gtgactgcgg 162 0 
actatttttc ttaagcaaga tatgctaaag ttttgtgagc ctttttccag agagaggtct 1680 
catatctgca tcaagtgaga acatataatg tctgcatgtt tccatatttc aggaatgttt 1740 
gcttgtgttt tatgctttta tatagacagg gaaacttgtt cctcagtgac ccaaaagagg 1800 
tgggaattgt tattggatat catcattggc ccacgctttc tgaccttgga aacaattaag 1860 
ggttcataat ctcaattctg tcagaattgg tacaagaaat agctgctatg tttcttgaca 1920 
ttccacttgg taggaaataa gaatgtgaaa ctcttcagtt ggtgtgtgtc cctngttttt 1980 
ttgcaatttc cttcttactg tgttaaaaaa aagtatgatc ttgctctgag aggtgaggca 2040 
ttcttaatca tgatctttaa agatcaataa tataatcctt tcaaggatta tgtctttatt 2100 
ataataaaga taatttgtct ttaacagaat caataatata atcccttaaa ggattatatc 2160 
tttgctgggc gcagtggctc acacctgtaa tcccagcact ttgggtggcc aaggtggaag 222 0 
gatcaaattt gcctacttct atattatctt ctaaagcaga attcatctct cttccctcaa 2280 
tatgatgata ttgacagggt ttgccctcac tcactagatt gtgagctcct gctcagggca 2340 
ggtagcgttt tttgtttttg tttttgtttt tcttttttga gacagggtct tgctctgtca 2400 
cccaggccag agtgcaatgg tacagtctca gctcactgca gcctcaaccg cctcggctca 2460 
aaccatcatc ccatttcagc ctcctgagta gctgggacta caggcacatg ccattacacc 2520 
tggctaattt ttttgtattt ctagtagaga cagggtttgg ccatgttgcc cgggctggtc 2580 
tcgaactcct ggactcaagc aatccaccca cctcagcctc ccaaaatgag ggaccgtgtc 2640 
ttattcattt ccatgtccct agtccatagc ccagtgctgg acctatggta gtactaaata 2700 
aatatttgtt gaatgcaata gtaaatagca tttcagggag caagaactag attaacaaag 2760 
gtggtaaaag gtttggagaa aaaaataata gtttaatttg gctagagtat gagggagagt 2820 
agtaggagac aagatggaaa ggtctcttgg gcaaggtttt gaaggaagtt ggaagtcaga 288 0 
agtacacaat gtgcatatcg tggcaggcag tggggagcca atgaaggctt ttgagcagga 2 94 0 
gagtaatgtg ttgaaaaata aatataggtt aaacctatca gagcccctct gacacataca 3000 
cttgcttttc attcaagctc aagtttgtct cccacatacc cattacttaa ctcaccctcg 3060 
ggctccccta gcagcctgcc ctacctcttt acctgcttcc tggtggagtc agggatgtat 3120 
acatgagctg ctttccctct cagccagagg acatgggggg ccccagctcc cctgcctttc 3180 
cccttctgtg cctggagctg ggaagcaggc cagggttagc tgaggctggc tggcaagcag 324 0 
ctgggtggtg ccagggagag cctgcatagt gccaggtggt gccttgggtt ccaagctagt 33 00 
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ccatggcccc gataaccttc tgcctgtgca cacacctgcc cctcactcca cccccatcct 3360 
agctttggta tgggggagag ggcacagggc cagacaaacc tgtgagactt tggctccatc 3420 
tctgcaaaag ggcgctctgt gagtcagcct gctcccctcc aggcttgctc ctcccccacc 3480 
cagctctcgt ttccaatgca cgtacagccc gtacacaccg tgtgctggga caccccacag 3 54 0 
tcagccgcat ggctcccctg tgccccagcc cctggctccc tctgttgatc ccggcccctg 3600 
ctccaggcct cactgtgcaa ctgctgctgt cactgctgct tctggtgcct .gtccatcccc 3660 
agaggttgcc ccggatgcag gaggattccc ccttgggagg aggctcttct ggggaagatg 3720 
acccactggg cgaggaggat ctgcccagtg aagaggattc acccagagag gaggatccac 3780 
ccggagagga ggatctacct ggagaggagg atctacctgg agaggaggat ctacctgaag 3 840 
ttaagcctaa atcagaagaa gagggctccc tgaagttaga ggatctacct actgttgagg 3 900 
ctcctggaga tcctcaagaa ccccagaata atgcccacag ggacaaagaa ggtaagtggt 3 960 
catcaatctc caaatccagg ttccaggagg ttcatgactc ccctcccata ccccagccta 4020 
ggctctgttc actcagggaa ggaggggaga ctgtactccc cacagaagcc cttccagagg 4080 
tcccatacca atatccccat ccccactctc ggaggtagaa agggacagat gtggagagaa 4140 
aataaaaagg gtgcaaaagg agagaggtga gctggatgag atgggagaga agggggaggc 42 00 
tggagaagag aaagggatga gaactgcaga tgagagaaaa aatgtgcaga cagaggaaaa 42 60 
aaataggtgg agaaggagag tcagagagtt tgaggggaag agaaaaggaa agcttgggag 4320 
gtgaagtggg taccagagac aagcaagaag agctggtaga agtcatctca tcttaggcta 43 80 
caatgaggaa ttgagaccta ggaagaaggg acacagcagg tagagaaacg tggcttcttg 4440 
actcccaagc caggaatttg gggaaagggg ttggagacca tacaaggcag agggatgagt 4500 
ggggagaaga aagaagggag aaaggaaaga tggtgtactc actcatttgg gactcaggac 4560 
tgaagtgccc actcactttt tttttttttt tttttgagac aaactttcac ttttgttgcc 4620 
caggctggag tgcaatggcg cgatctcggc tcactgcaac ctccacctcc cgggttcaag 4680 
tgattctcct gcctcagcct ctagccaagt agctgcgatt acaggcatgc gccaccacgc 4740 
ccggctaatt tttgtatttt tagtagagac ggggtttcgc catgttggtc aggctggtct 4800 
cgaactcctg atctcaggtg atccaaccac cctggcctcc caaagtgctg ggattatagg 4860 
cgtgagccac agcgcctggc ctgaagcagc cactcacttt tacagaccct aagacaatga 4920 
ttgcaagctg gtaggattgc tgtttggccc acccagctgc ggtgttgagt ttgggtgcgg 4980 
tctcctgtgc tttgcacctg gcccgcttaa ggcatttgtt acccgtaatg ctcctgtaag 5040 
gcatctgcgt ttgtgacatc gttttggtcg ccaggaaggg attggggctc taagcttgag 5100 
cggttcatcc ttttcattta tacaggggat gaccagagtc attggcgcta tggaggtgag 5160 
acacccaccc gctgcacaga cccaatctgg gaacccagct ctgtggatct cccctacagc 5220 
cgtccctgaa cactggtccc gggcgtccca cccgccgccc accgtcccac cccctcacct 52 80 
tttctacccg ggttccctaa gttcctgacc taggcgtcag acttcctcac tatactctcc 5340 
caccccaggc gacccgccct ggccccgggt gtccccagcc tgcgcgggcc gcttccagtc 5400 
cccggtggat atccgccccc agctcgccgc cttctgcccg gccctgcgcc ccctggaact 5460 
cctgggcttc cagctcccgc cgctcccaga actgcgcctg cgcaacaatg gccacagtgg 552 0 
tgagggggtc tccccgccga gacttgggga tggggcgggg cgcagggaag ggaaccgtcg 5580 
cgcagtgcct gcccgggggt tgggctggcc ctaccgggcg gggccggctc acttgcctct 564 0 
ccctacgcag tgcaactgac cctgcctcct gggctagaga tggctctggg tcccgggcgg 5700 
gagtaccggg ctctgcagct gcatctgcac tggggggctg caggtcgtcc gggctcggag 5760 
cacactgtgg aaggccaccg tttccctgcc gaggtgagcg cggactggcc gagaaggggc 5820 
aaaggagcgg ggcggacggg ggccagagac gtggccctct cctaccctcg tgtccttttc 5880 
agatccacgt ggttcacctc agcaccgcct ttgccagagt tgacgaggcc ttggggcgcc 5940 
cgggaggcct ggccgtgttg gccgcctttc tggaggtacc agatcctgga caccccctac 6000 
tccccgcttt cccatcccat gctcctcccg gactctatcg tggagccaga gaccccatcc 6060 
cagcaagctc actcaggccc ctggctgaca aactcattca cgcactgttt gttcatttaa 6120 
cacccactgt gaaccaggca ccagccccca acaaggattc tgaagctgta ggtccttgcc 6180 
tctaaggagc ccacagccag tgggggaggc tgacatgaca gacacatagg aaggacatag 6240 
taaagatggt ggtcacagag gaggtgacac ttaaagcctt cactggtaga aaagaaaagg 63 00 
aggtgttcat tgcagaggaa acagaatgtg caaagactca gaatatggcc tatttaggga 63 60 
atggctacat acaccatgat tagaggaggc ccagtaaagg gaagggatgg tgagatgcct 6420 
gctaggttca ctcactcact tttatttatt tatttatttt tttgacagtc tctctgtcgc 6480 
ccaggctgga gtgcagtggt gtgatcttgg gtcactgcaa cttccgcctc ccgggttcaa 6540 
gggattctcc tgcctcagct tcctgagtag ctggggttac aggtgtgtgc caccatgccc 6600 
agctaatttt tttttgtatt tttagtagac agggtttcac catgttggtc aggctggtct 6660 
caaactcctg gcctcaagtg atccgcctga ctcagcctac caaagtgctg attacaagtg 6720 
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tgagccaccg 
tcagagaaat 
cttaacatta 
gtcaggacct 
caacacaaag 
agactgcaaa 
tctctccctc 
cagcaagagt 
taaaaaaaaa 
agcattctca 
tcagccatgg 
cccggaagaa 
aggtcagttt 
tggagcttca 
gagccagcgc 
ctgcctacag 
ccagcacttt 
gccaacatgg 
gggtgcctgt 
ggcagaagtt 
gactcttgtc 
aaaaaacaag 
ctttttctga 
ttgttggaaa 
ctagaccttt 
gttttgtata 
tctttttttt 
gccaggctgc 
gggattcatt 
atggtacaca 
cctcccttcc 
caggcctctt 
agggcctgca 
gaaactgtat 
tagatcctct 
ccctctgact 
cagggtgtca 
ctggggtgtg 
caggagaaga 
gggaggctga 
agtgtgaccc 
gtatgcggcc 
gagtttgaga 
atttatttat 
cctgaggtgc 
cccacactgt 
gtgactctcg 
aggcctcctt 
tgtctggttt 
attggtggtc 
gcctggctgc 
ccattcagcc 
accccaaccc 
gctaattgat 
cccccctttt 
cgatcatagc 
ctcaaagcac 



tgcccagcca 
gcctccatca 
ggttcataag 
cacctgaaaa 
gtgtatatat 
cgtcagaagg 
tctctccagc 
acatagagtt 
aacaacagca 
gagctgagga 
ccctggatac 
aacagtgcct 
gttggtctgg 
ggtctgaggc 
tcatcttgat 
attgaaaacc 
gggaggccaa 
tgaaacccca 
aatcccagct 
gcagtgagcc 
tcaaaaaaaa 
accaaa.aaat 
gaactgttta 
tcgttctctt 
taggtttctg 
gttatcaata 
tttttttttt 
tctcaaactc 
ttttcttttt 
gagttaagag 
ctcccacctt 
ccagttgctc 
cttagtgaag 
ccctataccc 
tcacaggctc 
tcagccgcta 
tctggactgt 
tgtggacaca 
aagaaatcaa 
ggtgggagaa 
catctctacc 
tagtcccagc 
ctgcagtgag 
aaaagaaatc 
tggttgtgag 
ccactgacct 
gctacagctg 
ccctgctgga 
ccccccagcc 
acagcccgcc 
tggtgagtct 
ccagggctgc 
caatattaga 
tagaatgaag 
tttaaagata 
tcactgcagc 
tgggactgta 



cactcactga 
tagcatgtca 
caaaataaga 
gccaaacaca 
ggtttcctgt 
gcacgggtca 
ttgtcattga 
tgaaataata 
acaacaaaaa 
atgggagagg 
atgcactcat 
atgagcagtt 
ccactaatct 
tggagatggg 
aataaccatg 
aagcaaaaac 
ggcaggtgga 
tctctactaa 
actcgggagg 
gagatcgtgc 
aaaaaaaaaa 
ggtgtttgga 
tctttaataa 
cttagtcact 
ctagactagg 
ttcatattta 
ttttttacat 
ctgaccttgt 
aatttgctct 
tgtagactca 
cccttctctc 
caaagccctg 
aagtggtctc 
tgaagcttta 
agagactcag 
cttccaatat 
gtttaaccag 
gtgggtgcgg 
ggctgggctc 
tggtttgagc 
aaaaaaaccc 
tactcaagga 
ctatgatccc 
aagaggctgg 
ctggcctggg 
ccctagctcc 
aacttccgag 
gtggacagca 
agtagtccct 
tctcacatct 
gcccctcctc 
tcaggaccgc 
gaggcagatc 
cttgagaaat 
gggtctcact 
ctcgaactcc 
ggcatgagcc 



ttctttaatg 
atatgttcat 
aaaaagaata 
gaatcatgaa 
ggggagtatg 
ctgagagcct 
aaaccagtcc 
cataggattt 
gcaacaacca 
actatgggaa 
ctgtcttaca 
gctgtctcgc 
ctgtggccta 
ctccctccag 
aagctgacag 
cgccgggcac 
tcacgaggtc 
aaatacgaaa 
ctgaggcagg 
cactgcactc 
gaaaaccaag 
aattgtcaag 
gcatcaaata 
cttgggtcat 
tagaactctg 
tttacaagtt 
ctttagtaga 
gatccaccag 
gggcttaaac 
gacggtcttt 
cttcctttct 
tacttttttt 
agagttgagt 
agggggtgca 
gtcccaggac 
gaggggtctc 
acagtgatgc 
gggaaagagg 
tgtggcttac 
ccaggagttc 
caacaaaacc 
ggctgaggtg 
accactgcct 
atggggaata 
acccttgttt 
acaccctctc 
cgacgcagcc 
gtcctcgggc 
tatcctccca 
cctttttctc 
ttggtcctga 
ctctgctccc 
atggtgggga 
ctcccagcat 
ctgtttgccc 
taggctcagg 
actgtgcctg 



ccagccacac 
actcttaggt 
ataaataaaa 
ggtgaatgca 
tacggaggca 
agtatcctag 
accaagcttg 
taagagggag 
ttacaatttt 
cccccttcat 
atgtcattcc 
ttggaagaaa 
gttcataaag 
tgcaggaggg 
acacagttac 
ggtggctcac 
aagagatcaa 
aaatagccag 
agaatggcat 
cagcctgggc 
caaaaaccaa 
gtcaagtctg 
ttttaacttt 
tttaaatctc 
cctttgcatt 
attcagatca 
gacagggttt 
cctcggcctc 
ttgtggccca 
cttctttcct 
ttcttcctct 
tgagttaacg 
taccttggct 
atgtagatga 
tggacatatc 
tgactacacc 
tgagtgctaa 
atgtaagatg 
gcctataatc 
aagacaaggc 
aaaaatagcc 
ggaagatcgc 
accatcttta 
caggagctgg 
cctgtcatgc 
tgacaccctg 
tttgaatggg 
tgctgagcca 
tgtgtgtgcc 
tccagtccag 
tgccaggaga 
tctccttttc 
ttcccccatt 
ccctctcgca 
caggctgggg 
caatcctttc 
gccccaaacg 



agcacaaagt 
tcatgatgtt 
gaagtggcat 
gaggtgacac 
gcagtgagtg 
taaagtgggc 
ttggttcgca 
acactgtctc 
atgttccctc 
gttccggcct 
cccaggaggg 
tcgctgagga 
aatcaccctt 
attgaagcat 
ccgcaaacgg 
gcctgtaatc 
gaccatcctg 
gcgtggtggc 
gaacccggga 
aacagagcga 
aatgagacaa 
gagagctaaa 
gtaaatactt 
acttactcta 
tcttgtgtct 
ttttttcttt 
caccatattg 
ccaaagtgct 
gcactttatg 
tctcttcctt 
cttgcttcct 
tcttatggga 
tctgggaggt 
gaccccaaca 
tgcactcctg 
gccctgtgcc 
gcaggtgggc 
agatgagaaa 
ccaccacgtt 
ggggcaacat 
gggcatggtg 
ttgattccag 
ggatacattt 
agggtggagc 
catgaaccca 
tggggacctg 
cgagtgattg 
ggtacagctt 
agtgtctgtc 
ctgaattcct 
ctcctcagca 
tgcagaacag 
gtccccagag 
aaagaatccc 
tgttgtggca 
accttagctt 
gcccttttac 



6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

82.80 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 
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ttggctttta 
cccttggctg 
ggggtggtgg 
aaagcagccc 
ttttgctgtc 
tgaccctttc 
atgcaaatga 
gaaggggaac 
agaggctgga 
actgtcctgt 
aatatttata 
gaatttccta 
tcggcctcct 



ggaagcaaaa 
gcctcttctg 
agtgcactga 
tctctgctct 
accagcgtcg 
ttcaggcaca 
gctgctcctg 
caaagggggt 
tcttggagaa 
cctgctcatt 
ataaaatatg 
ttactgttat 
tccacacatc 



acggtgctta 
gagactgagg 
ggcaggtgtt 
ccatcgcagg 
cgttccttgt 
agcttccccc 
ggccagtttt 
gtgagctacc 
tgtgagaagc 
atgccacttc 
tgttagtcac 
tagcaccaat 
actccaatgt 



tcttacccct 
cactatgggg 
gaggaactct 
tgacatccta 
gcagatgaga 
acccttgtgg 
ctgattagcc 
gcccagcaga 
cagccagagg 
cttttaactg 
ctttgttccc 
ttagtggtaa 
gttgctcc 



tctcgtgtat 
ctgcctgaga 
gcagacccct 
gccctggttt 
aggcagcaca 
agtcacttca 
tttcctgttg 
ggtagccgag 
catctgaggg 
ccaagaaatt 
caaatcagaa 
tgcatttatt 



ccaccctcat 
actcggggca 
cttccttccc 
ttggcctcct 
ggtattacac 
tgcaaagcgc 
tgtacacaca 
actggagcct 
ggagccggta 
ttttaaaata 
ggaggtattt 
ctattacagt 



10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10898 



<210> 6 
<211> 37 
<212> PRT 
<213> HUMAN 

<400> 6 

Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu lie Pro Ala 
15 10 15 

Pro Ala Pro Gly Leu Thr Val Gin Leu Leu Leu Ser Leu Leu Leu Leu 
20 25 30 



Met Pro Val His Pro 
35 



<210> 7 

<211> 25 

<212> DNA 

<213> HUMAN 

<400> 7 

tggggttctt gaggatctcc aggag 25 



<210> 8 

<211> 26 

<212> DNA 

<213> HUMAN 

<400> 8 

ctctaacttc agggagccct cttctt 26 



<210> 9 

<211> 48 

<212> DNA 

<213> HUMAN 

<220> 

<221> primer_bind 
<222> (1) . . (48) 
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<223> anchor primer that anneals to the homopolymeric tail. 



<220> 

<221> inosine 

<222> (36).. (37) (41).. (42) (46) . . (47) 

<223> each of the modif ied_bases at positions (36) , (37) , (41) 
(47) are inosine 



(42), (46) and 



<400> 9 

cuacuacuac uaggccacgc gtcgactagt acgggaaggg aagggaag 



48 



<210> 10 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 10 

Glu Glu Asp Leu Pro Ser 
1 5 



<210> 11 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 11 

Gly Glu Asp Asp Pro Leu 
1 5 



<210> 12 
<211> 21 
<212> PRT 
<213> HUMAN 

<400> 12 

Asn Asn Ala His Arg Asp Lys Glu Gly Asp Asp Gin Ser His Trp Arg 
15 10 15 



Tyr Gly Gly Asp Pro 
20 



<210> 13 

<211> 16 

<212> PRT 

<213> HUMAN 



<400> 13 

His Pro Gin Arg Leu Pro Arg Met Gin Glu Asp Ser Pro Leu Gly Gly 
15 10 15 



<210> 14 
<211> 24 
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<212> PRT 
<213> HUMAN 



<400> 14 

Glu Glu Asp Ser Pro Arg Glu Glu Asp Pro Pro Gly Glu Glu Asp Leu 
15 10 15 

Pro Gly Glu Glu Asp Leu Pro Gly 
20 



<210> 15 
<211> 13 
<212> PRT 
<213> HUMAN 



<400> 15 

Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala Tyr Glu Gin 
1 5 10 



<210> 16 
<211> 16 
<212> PRT 
<213> HUMAN 

<400> 16 

Met Arg Arg Gin His Arg Arg Gly Thr Lys Gly Gly Val Ser Tyr Arg 
1 5 10 15 



<210> 17 
<211> 45 
<212> DNA 
<213> HUMAN 



<400> 17 

gtcgctagct ccatgggtca tatgcagagg ttgccccgga tgcag 



<210> 18 
<211> 43 
<212> DNA 
<213> HUMAN 



<400> 18 

gaagatctct tactcgagca ttctccaaga tccagcctct agg 



<210> 19 
<211> 10 
<212> DNA 
<213> HUMAN 

<400> 19 
ctccatctct 
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<210> 20 
<211> 10 
<212> DNA 
<213> HUMAN 

<400> 20 
ccacccccat 



<210> 21 

<400> 21 
000 



<210> 22 
<211> 8 
<212> PRT 
<213> HUMAN 

<400> 22 

Leu Glu His His His His His His 
1 5 



<210> 23 
<211> 10 
<212> DNA 
<213> HUMAN 

<220> 

<221> misc_f eature 
<222> (1) . . (10) 

<400> 23 
yyycayyyyy 



<210> 24 

<211> 10 

<212> DNA 

<213> HUMAN 

<300> 

<301> Locker and Buzard, 

<3 03> DNA Sequencing and Mapping 

<304> 1 

<306> 3-11 

<307> 1990 

<400> 24 
tgtgagactt 



<210> 25 
<211> 4 



<212> PRT 
<213> HUMAN 

<220> 

<221> SITE 
<222> (1) . . (4) 

<223> sequence element defined by Suzuki, J. Mol . Biol., 207: 61-84 (1989) 
motif frequently found in gene regulatory proteins. 



<220> 

<221> VARIANTS 
<222> (3) . . (4) 

<223> variants in sequence element defined by Suzuki, J. Mol. Biol., 207: 6 
94 (1989) as motif frequently found in gene regulatory proteins. 

<400> 25 
Ser Pro Xaa Xaa 
1 



<210> 26 
<211> 4 
<212> PRT 
<213> HUMAN 

<220> 

<221> SITE 
<222> (1) . . (4) 

<223> sequence element defined by Suzuki, J. Mol. Biol., 207: 61-84 (1989) 
a motif frequently found in gene regulatory proteins. 

<220> 

<221> VARIANTS - 
<222> (3) . . (4) 

<223> variants in sequence element defined by Suzuki, J. Mol. Biol., 207: 6 
84 (1989) as a motif frequently found in gene regulatory proteins. 

<400> 26 
Thr Pro Xaa Xaa 
1 



<210> 27 
<211> 540 
<212> DNA 
<213> HUMAN 

<220> 

<221> promoter 
<222> (1) . . (540) 

<400> 27 

cttgcttttc attcaagctc aagtttgtct cccacatacc cattacttaa ctcaccctcg 60 
ggctccccta gcagcctgcc ctacctcttt acctgcttcc tggtggagtc agggatgtat 12 0 
acatgagctg ctttccctct cagccagagg acatgggggg ccccagctcc cctgcctttc 180 
cccttctgtg cctggagctg ggaagcaggc cagggttagc tgaggctggc tggcaagcag 24 0 



13 



ctgggtggtg ccagggagag cctgcatagt 
ccatggcccc gataaccttc tgcctgtgca 
agctttggta tgggggagag ggcacagggc 
tctgcaaaag ggcgctctgt gagtcagcct 
cagctctcgt ttccaatgca cgtacagccc 



gccaggtggt gccttgggtt ccaagctagt 3 00 
cacacctgcc cctcactcca cccccatcct 360 
cagacaaacc tgtgagactt tggctccatc 420 
gctcccctcc aggcttgctc ctcccccacc 480 
gtacacaccg tgtgctggga caccccacag 540 



<210> 28 

<211> 445 

<212> DNA 

<213> HUMAN 

<220> 

<221> exon 

<222> (1) 

<223> 1st MN exon 



<400> 28 

gcccgtacac accgtgtgct gggacacccc acagtcagcc gcatggctcc cctgtgcccc 60 
agcccctggc tccctctgtt gatcccggcc cctgctccag gcctcactgt gcaactgctg 120 
ctgtcactgc tgcttctggt gcctgtccat ccccagaggt tgccccggat gcaggaggat 180 
tcccccttgg gaggaggctc ttctggggaa gatgacccac tgggcgagga ggatctgccc 240 
agtgaagagg attcacccag agaggaggat ccacccggag aggaggatct acctggagag 3 00 
gaggatctac ctggagagga ggatctacct gaagttaagc ctaaatcaga agaagagggc 3 60 
tccctgaagt tagaggatct acctactgtt gaggctcctg gagatcctca agaaccccag 420 
aataatgccc acagggacaa agaag 445 



<210> 29 

<211> 30 

<212> DNA 

<213> HUMAN 

<220> 

<221> exon 

<222> (1) 

<22 3> 2nd MN exon 

<400> 29 

gggatgacca gagtcattgg cgctatggag 3 0 



<210> 30 
<211> 171 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 

<222> (1) 

<223> 3rd MN exon 

<400> 30 

gcgacccgcc ctggccccgg gtgtccccag cctgcgcggg ccgcttccag tccccggtgg 60 

atatccgccc ccagctcgcc gccttctgcc cggccctgcg ccccctggaa ctcctgggct 120 

tccagctccc gccgctccca gaactgcgcc tgcgcaacaa tggccacagt g 171 
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<210> 31 
<211> 143 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 

<222> (1) 

<223> 4th MN exon 

<400> 31 

tgcaactgac cctgcctcct gggctagaga 
ctctgcagct gcatctgcac tggggggctg 
aaggccaccg tttccctgcc gag 



tggctctggg tcccgggcgg gagtaccggg 60 
caggtcgtcc gggctcggag cacactgtgg 12 0 

143 



<210> 32 
<211> 93 
<212> DNA 
<213> HUMAN 

<220> 

< 2 2 1 > exon 

<222> (1) 

<223> 5th MN exon 

<400> 32 

atccacgtgg ttcacctcag caccgccttt gccagagttg acgaggcctt ggggcgcccg 60 
ggaggcctgg ccgtgttggc cgcctttctg gag 93 



<210> 33 

<211> 67 

<212> DNA 

<213> HUMAN 

<220> 

<221> exon 
<222> (1) 

<223> 6th MN exon 
<400> 33 

gagggcccgg aagaaaacag tgcctatgag cagttgctgt ctcgcttgga agaaatcgct 60 
gaggaag 67 



<210> 34 
<211> 158 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 

<222> (1) 

<223> 7th MN exon 
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<400> 34 

gctcagagac tcaggtccca ggactggaca 
gctacttcca atatgagggg tctctgacta 
ctgtgtttaa ccagacagtg atgctgagtg 



tatctgcact cctgccctct gacttcagcc 60 
caccgccctg tgcccagggt gtcatctgga 12 0 
ctaagcag 15 8 



<210> 35 
<211> 145 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 
<222> (1) 

<223> 8th MN exon 
<400> 35 

ctccacaccc tctctgacac cctgtgggga 
cgagcgacgc agcctttgaa tgggcgagtg 
agcagtcctc gggctgctga gccag 



cctggtgact ctcggctaca gctgaacttc 60 
attgaggcct ccttccctgc tggagtggac 12 0 

145 



<210> 36 
<211> 27 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 
<222> (1) 

<223> 9th MN exon 
<400> 36 

tccagctgaa ttcctgcctg gctgctg 27 



<210> 37 
<211> 82 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 
<222> (1) 

<223> 10th MN exon 
<400> 37 

gtgacatcct agccctggtt tttggcctcc 
tgcagatgag aaggcagcac ag 



tttttgctgt caccagcgtc gcgttccttg 60 

82 



<210> 38 

<211> 191 

<212> DNA 

<213> HUMAN 

<220> 
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<221> exon 
<222> (1) 
<223> 11th 



MN exon 



<400> 38 

aaggggaacc aaagggggtg tgagctaccg cccagcagag gtagccgaga ctggagccta 60 
gaggctggat cttggagaat gtgagaagcc agccagaggc atctgagggg gagccggtaa 12 0 
ctgtcctgtc ctgctcatta tgccacttcc ttttaactgc caagaaattt tttaaaataa 180 
atatttataa t 191 



<210> 39 
<211> 1174 
<212> DNA 
<213> HUMAN 

<220> 

<221> intron 
<222> (1) . . (1174) 
<223> 1st MN intron 

<400> 39 

gtaagtggtc atcaatctcc aaatccaggt tccaggaggt tcatgactcc cctcccatac 60 
cccagcctag gctctgttca ctcagggaag gaggggagac tgtactcccc acagaagccc 12 0 
ttccagaggt cccataccaa tatccccatc cccactctcg gaggtagaaa gggacagatg 180 
tggagagaaa ataaaaaggg tgcaaaagga gagaggtgag ctggatgaga tgggagagaa 24 0 
99999 a 99 ct ggagaagaga aagggatgag aactgcagat gagagaaaaa atgtgcagac 300 
agaggaaaaa aataggtgga gaaggagagt cagagagttt gaggggaaga gaaaaggaaa 360 
gcttgggagg tgaagtgggt accagagaca agcaagaaga gctggtagaa gtcatctcat 42 0 
cttaggctac aatgaggaat tgagacctag gaagaaggga cacagcaggt agagaaacgt 4 80 
ggcttcttga ctcccaagcc aggaatttgg ggaaaggggt tggagaccat acaaggcaga 54 0 
999 at 9 a 9t9 gg9 a 9 aa 9 aa agaagggaga aaggaaagat ggtgtactca ctcatttggg 600 
actcaggact gaagtgccca ctcacttttt tttttttttt ttttgagaca aactttcact 660 
tttgttgccc aggctggagt gcaatggcgc gatctcggct cactgcaacc tccacctccc 720 
gggttcaagt gattctcctg cctcagcctc tagccaagta gctgcgatta caggcatgcg 780 
ccaccacgcc cggctaattt ttgtattttt agtagagacg gggtttcgcc atgttggtca 84 0 
ggctggtctc gaactcctga tctcaggtga tccaaccacc ctggcctccc aaagtgctgg 900 
gattataggc gtgagccaca gcgcctggcc tgaagcagcc actcactttt acagacccta 960 
agacaatgat tgcaagctgg taggattgct gtttggccca cccagctgcg gtgttgagtt 1020 
tgggtgcggt ctcctgtgct ttgcacctgg cccgcttaag gcatttgtta cccgtaatgc 1080 
tcctgtaagg catctgcgtt tgtgacatcg ttttggtcgc caggaaggga ttggggctct 1140 
aagcttgagc ggttcatcct tttcatttat acag 1174 



<210> 40 
<211> 193 
<212> DNA 
<213> HUMAN 

<220> 

<221> intron 
<222> (1) . . (193) 
<223> 2nd MN intron 



<400> 40 

gtgagacacc cacccgctgc acagacccaa 
acagccgtcc ctgaacactg gtcccgggcg 



tctgggaacc cagctctgtg gatctcccct 60 
tcccacccgc cgcccaccgt cccaccccct 120 
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caccttttct acccgggttc cctaagttcc tgacctaggc gtcagacttc ctcactatac 180 
tctcccaccc cag 193 



<210> 41 
<211> 131 
<212> DNA 
<213> HUMAN 

<220> 

<221> intron 
<222> (1) . . (131) 
<223> 3rd MN intron 

<400> 41 

gtgagggggt ctccccgccg agacttgggg 
gcgcagtgcc tgcccggggg ttgggctggc 
tccctacgca g 



atggggcggg gcgcagggaa gggaaccgtc 60 
cctaccgggc ggggccggct cacttgcctc 12 0 

131 



<210> 42 
<211> 89 
<212> DNA 
<213> HUMAN 

<220> 

<221> intron 
<222> (1) . . (89) 
<223> 4th MN intron 

<400> 42 

gtgagcgcgg actggccgag aaggggcaaa 
gccctctcct accctcgtgt ccttttcag 



ggagcggggc ggacgggggc cagagacgtg 60 

89 



<210> 43 

<211> 1400 

<212> DNA 

<213> HUMAN 

<220> 

<221> intron 
<222> (1) . . (1400) 
<223> 5th MN intron 



cctactcccc gctttcccat cccatgctcc tcccggactc 60 
catcccagca agctcactca ggcccctggc tgacaaactc 120 
tttaacaccc actgtgaacc aggcaccagc ccccaacaag 180 
ttgcctctaa ggagcccaca gccagtgggg gaggctgaca 24 0 
catagtaaag atggtggtca cagaggaggt gacacttaaa 300 
aaaggaggtg ttcattgcag aggaaacaga atgtgcaaag 3 60 
agggaatggc tacatacacc atgattagag gaggcccagt 420 
tgcctgctag gttcactcac tcacttttat ttatttattt 480 
gtcgcccagg ctggagtgca gtggtgtgat cttgggtcac 54 0 
ttcaagggat tctcctgcct cagcttcctg agtagctggg 600 
tgcccagcta attttttttt gtatttttag tagacagggt 660 



<400> 43 

gtaccagatc ctggacaccc 
tatcgtggag ccagagaccc 
attcacgcac tgtttgttca 
gattctgaag ctgtaggtcc 
tgacagacac ataggaagga 
gccttcactg gtagaaaaga 
actcagaata tggcctattt 
aaagggaagg gatggtgaga 
atttttttga cagtctctct 
tgcaacttcc gcctcccggg 
gttacaggtg tgtgccacca 
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ttcaccatgt tggtcaggct ggtctcaaac tcctggcctc aagtgatccg cctgactcag 72 0 
cctaccaaag tgctgattac aagtgtgagc caccgtgccc agccacactc actgattctt 780 
taatgccagc cacacagcac aaagttcaga gaaatgcctc catcatagca tgtcaatatg 840 
ttcatactct taggttcatg atgttcttaa cattaggttc ataagcaaaa taagaaaaaa 900 
gaataataaa taaaagaagt ggcatgtcag gacctcacct gaaaagccaa acacagaatc 960 
atgaaggtga atgcagaggt gacaccaaca caaaggtgta tatatggttt cctgtgggga 102 0 
gtatgtacgg aggcagcagt gagtgagact gcaaacgtca gaagggcacg ggtcactgag 1080 
agcctagtat cctagtaaag tgggctctct ccctctctct ccagcttgtc attgaaaacc 1140 
agtccaccaa gcttgttggt tcgcacagca agagtacata gagtttgaaa taatacatag 1200 
gattttaaga gggagacact gtctctaaaa aaaaaaacaa cagcaacaac aaaaagcaac 12 60 
aaccattaca attttatgtt ccctcagcat tctcagagct gaggaatggg agaggactat 1320 
gggaaccccc ttcatgttcc ggccttcagc catggccctg gatacatgca ctcatctgtc 1380 
ttacaatgtc attcccccag - 1400 



<210> 44 
<211> 1334 
<212> DNA 
<213> HUMAN 

<220> 

<221> intron 
<222> (1) . . (1334) 
<223> 6th MN intron 

<400> 44 

gtcagtttgt tggtctggcc actaatctct gtggcctagt tcataaagaa tcaccctttg 60 
gagcttcagg tctgaggctg gagatgggct ccctccagtg caggagggat tgaagcatga 12 0 
gccagcgctc atcttgataa taaccatgaa gctgacagac acagttaccc gcaaacggct 180 
gcctacagat tgaaaaccaa gcaaaaaccg ccgggcacgg tggctcacgc ctgtaatccc 24 0 
agcactttgg gaggccaagg caggtggatc acgaggtcaa gagatcaaga ccatcctggc 300 
caacatggtg aaaccccatc tctactaaaa atacgaaaaa atagccaggc gtggtggcgg 360 
gtgcctgtaa tcccagctac tcgggaggct gaggcaggag aatggcatga acccgggagg 42 0 
cagaagttgc agtgagccga gatcgtgcca ctgcactcca gcctgggcaa cagagcgaga 480 
ctcttgtctc aaaaaaaaaa aaaaaaaaga aaaccaagca aaaaccaaaa tgagacaaaa 54 0 
aaaacaagac caaaaaatgg tgtttggaaa ttgtcaaggt caagtctgga gagctaaact 600 
ttttctgaga actgtttatc tttaataagc atcaaatatt ttaactttgt aaatactttt 660 
gttggaaatc gttctcttct tagtcactct tgggtcattt taaatctcac ttactctact 72 0 
agacctttta ggtttctgct agactaggta gaactctgcc tttgcatttc ttgtgtctgt 780 
tttgtatagt tatcaatatt catatttatt tacaagttat tcagatcatt ttttcttttc 840 
tttttttttt tttttttttt ttttacatct ttagtagaga cagggtttca ccatattggc 900 
caggctgctc tcaaactcct gaccttgtga tccaccagcc tcggcctccc aaagtgctgg 960 
gattcatttt ttctttttaa tttgctctgg gcttaaactt gtggcccagc actttatgat 1020 
ggtacacaga gttaagagtg tagactcaga cggtctttct tctttccttc tcttccttcc 1080 
tcccttccct cccaccttcc cttctctcct tcctttcttt cttcctctct tgcttcctca 1140 
ggcctcttcc agttgctcca aagccctgta cttttttttg agttaacgtc ttatgggaag 1200 
ggcctgcact tagtgaagaa gtggtctcag agttgagtta ccttggcttc tgggaggtga 12 60 
aactgtatcc ctataccctg aagctttaag ggggtgcaat gtagatgaga ccccaacata 1320 
gatcctcttc acag 1334 



<210> 45 
<211> 512 
<212> DNA 
<213> HUMAN 

<220> 
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<221> intron 
<222> (1) . . (512) 
<223> 7th MN intron 



<400> 45 

gtgggcctgg ggtgtgtgtg 
gagaaacagg agaagaaaga 
cacgttggga ggctgaggtg 
caacatagtg tgaccccatc 
atggtggtat gcggcctagt 
ttccaggagt ttgagactgc 
acatttattt atttataaaa 
tggagccctg aggtgctggt 
aacccaccca cactgtccac 



gacacagtgg gtgcggggga 
aatcaaggct gggctctgtg 
ggagaatggt ttgagcccag 
tctaccaaaa aaaccccaac 
cccagctact caaggaggct 
agtgagctat gatcccacca 
gaaatcaaga ggctggatgg 
tgtgagctgg cctgggaccc 
tgacctccct ag 



aagaggatgt aagatgagat 60 
gcttacgcct ataatcccac 120 
gagttcaaga caaggcgggg 180 
aaaaccaaaa atagccgggc 24 0 
gaggtgggaa gatcgcttga 3 00 
ctgcctacca tctttaggat 360 
ggaatacagg agctggaggg 42 0 
ttgtttcctg tcatgccatg 480 

512 



<210> 46 
<211> 114 
<212> DNA 
<213> HUMAN 

<220> 

<221> intron 
<222> (1) . . (114) 
<223> 8th MN intron 

<400> 46 

gtacagcttt gtctggtttc cccccagcca gtagtccctt atcctcccat gtgtgtgcca 60 
gtgtctgtca ttggtggtca cagcccgcct ctcacatctc ctttttctct ccag 114 



<210> 47 

<211> 617 

<212> DNA 

<213> HUMAN 

<220> 

<221> intron 
<222> (1) . . (617) 
<223> 9th MN intron 



<400> 47 

gtgagtctgc ccctcctctt ggtcctgatg ccaggagact cctcagcacc attcagcccc 60 
agggctgctc aggaccgcct ctgctccctc tccttttctg cagaacagac cccaacccca 120 
atattagaga ggcagatcat ggtggggatt cccccattgt ccccagaggc taattgatta 180 
gaatgaagct tgagaaatct cccagcatcc ctctcgcaaa agaatccccc cccctttttt 240 
taaagatagg gtctcactct gtttgcccca ggctggggtg ttgtggcacg atcatagctc 3 00 
actgcagcct cgaactccta ggctcaggca atcctttcac cttagcttct caaagcactg 360 
ggactgtagg catgagccac tgtgcctggc cccaaacggc ccttttactt ggcttttagg 420 
aagcaaaaac ggtgcttatc ttaccccttc tcgtgtatcc accctcatcc cttggctggc 480 
ctcttctgga gactgaggca ctatggggct gcctgagaac tcggggcagg ggtggtggag 540 
tgcactgagg caggtgttga ggaactctgc agacccctct tccttcccaa agcagccctc 600 
tctgctctcc atcgcag 617 



<210> 48 
<211> 130 
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<212> DNA 
<213> HUMAN 



<220> 

<221> intron 

<222> (1) . . (130) 

<223> 10th MN intron 



<400> 48 

gtattacact gaccctttct tcaggcacaa 
gcaaagcgca tgcaaatgag ctgctcctgg 
gtacacacag 



gcttccccca cccttgtgga gtcacttcat 60 
gccagttttc tgattagcct ttcctgttgt 120 

130 



<210> 49 

<211> 1401 

<212> DNA 

<213> HUMAN 



<400> 49 

caaactttca 

cctccacctc 

tacaggcatg 

ccatgttggt 

ccaaagtgct 

ttacagaccc 

cggtgttgag 

tacccgtaat 

gattggggct 

cattggcgct 

tctgtggatc 

caccgtccca 

gacttcctca 

ctgcgcgggc 

ggccctgcgc 

gcgcaacaat 

gcgcagggaa 

ggggccggct 

atggctctgg 

gcaggtcgtc 

gcggactggc 

tcctaccctc 

ttgacgaggc 

cagatcctgg 



cttttgttgc 
ccgggttcaa 
cgccaccacg 
caggctggtc 
gggattatag 
taagacaatg 
tttgggtgcg 
gctcctgtaa 
ctaagcttga 
atggaggtga 
tcccctacag 
ccccctcacc 
ctatactctc 
cgcttccagt 
cccctggaac 
ggccacagtg 
gggaaccgtc 
cacttgcctc 
gtcccgggcg 
cgggctcgga 
cgagaagggg 
gtgtcctttt 
cttggggcgc 
acacccccta 



ccaggctgga 
gtgattctcc 
cccggctaat 
tcgaactcct 
gcgtgagcca 
attgcaagct 
gtctcctgtg 
ggcatctgcg 
gcggttcatc 
gacacccacc 
ccgtccctga 
ttttctaccc 
ccaccccagg 
ccccggtgga 
tcctgggctt 
gtgagggggt 
gcgcagtgcc 
tccctacgca 
ggagtaccgg 
gcacactgtg 
caaaggagcg 
cagatccacg 
ccgggaggcc 
c 



gtgcaatggc 
tgcctcagcc 
ttttgtattt 
gatctcaggt 
cagcgcctgg 
ggtaggattg 
ctttgcacct 
tttgtgacat 
cttttcattt 
cgctgcacag 
acactggtcc 
gggttcccta 
cgacccgccc 
tatccgcccc 
ccagctcccg 
ctccccgccg 
tgcccggggg 
gtgcaactga 
gctctgcagc 
gaaggccacc 
gggcggacgg 
tggttcacct 
tggccgtgtt 



gcgatctcgg 
tctagccaag 
ttagtagaga 
gatccaacca 
cctgaagcag 
ctgtttggcc 
ggcccgctta 
cgttttggtc 
atacagggga 
acccaatctg 
cgggcgtccc 
agttcctgac 
tggccccggg 
cagctcgccg 
ccgctcccag 
agacttgggg 
ttgggctggc 
ccctgcctcc 
tgcatctgca 
gtttccctgc 
gggccagaga 
cagcaccgcc 
ggccgccttt 



ctcactgcaa 
tagctgcgat 
cggggtttcg 
ccctggcctc 
ccactcactt 
cacccagctg 
aggcatttgt 
gccaggaagg 
tgaccagagt 
ggaacccagc 
acccgccgcc 
ctaggcgtca 
tgtccccagc 
ccttctgccc 
aactgcgcct 
atggggcggg 
cctaccgggc 
tgggctagag 
ctggggggct 
cgaggtgagc 
cgtggccctc 
tttgccagag 
ctggaggtac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1401 



<210> 50 

<211> 59 

<212> PRT 

<213> HUMAN 



<400> 50 

Ser Ser Gly Glu Asp Asp Pro Leu Gly Glu Glu Asp Leu Pro Ser Glu 
15 10 15 

Glu Asp Ser Pro Arg Glu Glu Asp Pro Pro Gly Glu Glu Asp Leu Pro 
20 25 30 
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Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Glu Val Lys Pro 
35 40 45 



Lys Ser Glu Glu Glu Gly Ser Leu Lys Leu Glu 
50 55 



<210> 51 
<211> 257 
<212> PRT 
<213> HUMAN 

<400> 51 

Gly Asp Asp Gin Ser His Trp Arg Tyr Gly Gly Asp Pro Pro Trp Pro 
15 10 15 

Arg Val Ser Pro Ala Cys Ala Gly Arg Phe Gin Ser Pro Val Asp lie 
20 25 30 

Arg Pro Gin Leu Ala Ala Phe Cys Pro Ala Leu Arg Pro Leu Glu Leu 
35 40 45 

Leu Gly Phe Gin Leu Pro Pro Leu Pro Glu Leu Arg Leu Arg Asn Asn 
50 55 60 

Gly His Ser Val Gin Leu Thr Leu Pro Pro Gly Leu Glu Met Ala Leu 
65 70 75 80 

Gly Pro Gly Arg Glu Tyr Arg Ala Leu Gin Leu His Leu His Trp Gly 
85 90 95 

Ala Ala Gly Arg Pro Gly Ser Glu His Thr Val Glu Gly His Arg Phe 
100 105 110 

Pro Ala Glu lie His Val Val His Leu Ser Thr Ala Phe Ala Arg Val 
115 120 125 

Asp Glu Ala Leu Gly Arg Pro Gly Gly Leu Ala Val Leu Ala Ala Phe 
130 135 140 

Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala Tyr Glu Gin Leu Leu Ser 
145 150 155 160 

Arg Leu Glu Glu He Ala Glu Glu Gly Ser Glu Thr Gin Val Pro Gly 
165 170 175 

Leu Asp He Ser Ala Leu Leu Pro Ser Asp Phe Ser Arg Tyr Phe Gin 
180 185 190 

Tyr Glu Gly Ser Leu Thr Thr Pro Pro Cys Ala Gin Gly Val He Trp 
195 200 205 

Thr Val Phe Asn Gin Thr Val Met Leu Ser Ala Lys Gin Leu His Thr 
210 215 220 

Leu Ser Asp Thr Leu Trp Gly Pro Gly Asp Ser Arg Leu Gin Leu Asn 
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225 230 235 240 

Phe Arg Ala Thr Gin Pro Leu Asn Gly Arg Val lie Glu Ala Ser Phe 
245 250 255 



Pro 



<210> 52 
<211> 20 
<212> PRT 
<213> HUMAN 

<400> 52 

lie Leu Ala Leu Val Phe Gly Leu Leu Phe Ala Val Thr Ser Val Ala 
15 10 15 

Phe Leu Val Gin 
20 



<210> 53 
<211> 25 
<212> PRT 
<213> HUMAN 



<400> 53 

Met Arg Arg Gin His Arg Arg Gly 
1 5 

Pro Ala Glu Val Ala Glu Thr Gly 
20 



Thr Lys Gly Gly Val Ser Tyr Arg 
10 15 

Ala 
25 



<210> 54 
<211> 59 
<212> PRT 
<213> HUMAN 

<400> 54 
Ser Ala Ser Glu 
1 

Glu Pro Ser Pro 
20 

Ser Val Val Leu 
35 

Ser Pro Ser Glu 
50 



Glu Pro Ser Pro 
5 

Ser Glu Glu Pro 

Phe Pro Ser Glu 
40 

Glu Pro Ser Ala 
55 



Ser Glu Val Pro 
10 

Phe Pro Ser Val 
25 

Glu Pro Phe Pro 
Ser Glu Glu 



Phe Pro Ser Glu 
15 

Arg Pro Phe Pro 
30 

Ser Lys Glu Pro 
45 



<210> 55 
<211> 470 
<212> RNA 
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<213> HUMAN 



<400> 55 

cauggccccg auaaccuucu gccugugcac 
gcuuugguau gggggagagg gcacagggcc 
cugcaaaagg gcgcucugug agucagccug 
agcucucguu uccaaugcac guacagcccg 
cagccgcaug gcuccccugu gccccagccc 
uccaggccuc acugugcaac ugcugcuguc 
gagguugccc cggaugcagg aggauucccc 
cccacugggc gaggaggauc ugcccaguga 



acaccugccc cucacuccac ccccauccua 60 
agacaaaccu gugagacuuu ggcuccaucu 12 0 
cuccccucca ggcuugcucc ucccccaccc 180 
uacacaccgu gugcugggac accccacagu 24 0 
cuggcucccu cuguugaucc cggccccugc 3 00 
acugcugcuu cuggugccug uccaucccca 3 60 
cuugggagga ggcucuucug gggaagauga 42 0 
agaggauuca cccagagagg 470 



<210> .56 
<211> 292 
<212> DNA 
<213> HUMAN 



<400> 56 

gtttttttga gacggagtct tgcatctgtc atgcccaggc tggagtagca gtggtgccat 60 
ctcggctcac tgcaagctcc acctcccgag ttcacgccat tttcctgcct cagcctcccg 120 
agtagctggg actacaggcg cccgccacca tgcccggcta attttttgta tttttggtag 180 
agacggggtt tcaccgtgtt agccagaatg gtctcgatct cctgacttcg tgatccaccc 240 
gcctcggcct cccaaagttc tgggattaca ggtgtgagcc accgcacctg gc 2 92 



<210> 57 

<211> 262 

<212> DNA 

<213> HUMAN 



<400> 57 

tttctttttt gagacagggt cttgctctgt cacccaggcc agagtgcaat ggtacagtct 60 
cagctcactg cagcctcaac cgcctcggct caaaccatca tcccatttca gcctcctgag 120 
tagctgggac tacaggcaca tgccattaca cctggctaat ttttttgtat ttctagtaga 180 
gacagggttt ggccatgttg cccgggctgg tctcgaactc ctggactcaa gcaatccacc 240 
cacctcagcc tcccaaaatg ag 262 



<210> 58 
<211> 2501 
<212> DNA 
<213> HUMAN 

<220> 

<221> misc_f eature 
<222> (1) . . (2501) 

<223> region 5* to transcription initiation site as determined by RNase 
protection assay (nucleotide 3507 of Figures 2A-2F and of SEQ ID NO: 5) , 
corresponding to region of SEQ ID NO: 5 and Figures 2A-2F from nucleotide ( 
to nucleotide (2507) , in which region some regulatory elements are probably 
situated . 

<220> 

<221> unsure what base is at position 1968 
<222> (1968) 
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<223> unsure of base at position 1968, which is the same unknown base as that 
at position 1974 of SEQ ID NO. 5, i.e., the full-length MN genomic sequence, 
and of that unknown at position 1968 of SEQ ID NO: 90, and unknown at position 
647 of SEQ ID NO: 110. That unknown base is in the 5' region flanking the 
transcription initiation site (3507) as determined by RNase protection assay. 

<400> 58 

tgttgactcg tgaccttacc cccaaccctg tgctctctga aacatgagct gtgtccactc 60 
agggttaaat ggattaaggg cggtgcaaga tgtgctttgt taaacagatg cttgaaggca 12 0 
gcatgctcgt taagagtcat caccaatccc taatctcaag taatcaggga cacaaacact 180 
gcggaaggcc gcagggtcct ctgcctagga aaaccagaga cctttgttca cttgtttatc 24 0 
tgaccttccc tccactattg tccatgaccc tgccaaatcc ccctctgtga gaaacaccca 300 
agaattatca ataaaaaaat aaatttaaaa aaaaaataca aaaaaaaaaa aaaaaaaaaa 360 
aaaagactta cgaatagtta ttgataaatg aatagctatt ggtaaagcca agtaaatgat 42 0 
catattcaaa accagacggc catcatcaca gctcaagtct acctgatttg atctctttat 480 
cattgtcatt ctttggattc actagattag tcatcatcct caaaattctc ccccaagttc 540 
taattacgtt ccaaacattt aggggttaca tgaagcttga acctactacc ttctttgctt 600 
ttgagccatg agttgtagga atgatgagtt tacaccttac atgctgggga ttaatttaaa 660 
ctttacctct aagtcagttg ggtagccttt ggcttatttt tgtagctaat tttgtagtta 720 
atggatgcac tgtgaatctt gctatgatag ttttcctcca cactttgcca ctaggggtag 780 
gtaggtactc agttttcagt aattgcttac ctaagaccct aagccctatt tctcttgtac 840 
tggcctttat ctgtaatatg ggcatattta atacaatata atttttggag tttttt£gtt 900 
tgtttgtttg tttgtttttt tgagacggag tcttgcatct gtcatgccca ggctggagta 960 
gcagtggtgc catctcggct cactgcaagc tccacctccc gagttcacgc cattttcctg 1020 
cctcagcctc ccgagtagct gggactacag gcgcccgcca ccatgcccgg ctaatttttt 1080 
gtatttttgg tagagacggg gtttcaccgt gttagccaga atggtctcga tctcctgact 1140 
tcgtgatcca cccgcctcgg cctcccaaag ttctgggatt acaggtgtga gccaccgcac 1200 
ctggccaatt ttttgagtct tttaaagtaa aaatatgtct tgtaagctgg taactatggt 1260 
acatttcctt ttattaatgt ggtgctgacg gtcatatagg ttcttttgag tttggcatgc 1320 
atatgctact ttttgcagtc ctttcattac atttttctct cttcatttga agagcatgtt 1380 
atatctttta gcttcacttg gcttaaaagg ttctctcatt agcctaacac agtgtcattg 1440 
ttggtaccac ttggatcata agtggaaaaa cagtcaagaa attgcacagt aatacttgtt 1500 
tgtaagaggg atgattcagg tgaatctgac actaagaaac tcccctacct gaggtctgag 1560 
attcctctga cattgctgta tataggcttt tcctttgaca gcctgtgact gcggactatt 1620 
tttcttaagc aagatatgct aaagttttgt gagccttttt ccagagagag gtctcatatc 1680 
tgcatcaagt gagaacatat aatgtctgca tgtttccata tttcaggaat gtttgcttgt 1740 
gttttatgct tttatataga cagggaaact tgttcctcag tgacccaaaa gaggtgggaa 1800 
ttgttattgg atatcatcat tggcccacgc tttctgacct tggaaacaat taagggttca 1860 
taatctcaat tctgtcagaa ttggtacaag aaatagctgc tatgtttctt gacattccac 1920 
ttggtaggaa ataagaatgt gaaactcttc agttggtgtg tgtccctngt ttttttgcaa 1980 
tttccttctt actgtgttaa aaaaaagtat gatcttgctc tgagaggtga ggcattctta 2 040 
atcatgatct ttaaagatca ataatataat cctttcaagg attatgtctt tattataata 2100 
aagataattt gtctttaaca gaatcaataa tataatccct taaaggatta tatctttgct 2160 
gggcgcagtg gctcacacct gtaatcccag cactttgggt ggccaaggtg gaaggatcaa 2220 
atttgcctac ttctatatta tcttctaaag cagaattcat ctctcttccc tcaatatgat 2280 
gatattgaca gggtttgccc tcactcacta gattgtgagc tcctgctcag ggcaggtagc 2340 ' 
gttttttgtt tttgtttttg tttttctttt ttgagacagg gtcttgctct gtcacccagg 2400 
ccagagtgca atggtacagt ctcagctcac tgcagcctca accgcctcgg ctcaaaccat 2460 
catcccattt cagcctcctg agtagctggg actacaggca c 2501 



<210> 59 

<211> 292 

<212> DNA 

<213> HUMAN 

<220> 
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<221> misc 
<222> (1) 



feature 



<400> 59 

tttttttgag acggagtctt gcatctgtca tgcccaggct ggagtagcag tggtgccatc 60 
tcggctcact gcaagctcca cctcccgagt tcacgccatt ttcctgcctc agcctcccga 120 
gtagctggga ctacaggcgc ccgccaccat gcccggctaa ttttttgtat ttttggtaga 180 
gacggggttt caccgtgtta gccagaatgg tctcgatctc ctgacttcgt gatccacccg 240 
cctcggcctc ccaaagttct gggattacag gtgtgagcca ccgcacctgg cc 292 



<210> 60 
<211> 262 
<212> DNA 
<213> HUMAN 

<400> 60 

ttcttttttg agacagggtc ttgctctgtc 
agctcactgc agcctcaacc gcctcggctc 
agctgggact acaggcacat gccattacac 
acagggtttg gccatgttgc ccgggctggt 
acctcagcct cccaaaatga gg 



acccaggcca gagtgcaatg gtacagtctc 60 
aaaccatcat cccatttcag cctcctgagt 120 
ctggctaatt tttttgtatt tctagtagag 180 
ctcgaactcc tggactcaag caatccaccc 240 

262 



<210> 61 

<211> 294 

<212> DNA 

<213> HUMAN 



<400> 61 

tttttttttg agacaaactt tcacttttgt tgcccaggct ggagtgcaat ggcgcgatct 60 
cggctcactg caacctccac ctcccgggtt caagtgattc tcctgcctca gcctctagcc 120 
aagtagctgc gattacaggc atgcgccacc acgcccggct aatttttgta tttttagtag 180 
agacggggtt tcgccatgtt ggtcaggctg gtctcgaact cctgatctca ggtgatccaa 240 
ccaccctggc ctcccaaagt gctgggatta taggcgtgag ccacagcgcc tggc 2 94 



<210> 62 

<211> 276 

<212> DNA 

<213> HUMAN 



<400> 62 

tgacagtctc tctgtcgccc aggctggagt gcagtggtgt gatcttgggt cactgcaact 60 
tccgcctccc gggttcaagg gattctcctg cctcagcttc ctgagtagct ggggttacag 120 
gtgtgtgcca ccatgcccag ctaatttttt tttgtatttt tagtagacag ggtttcacca 180 
tgttggtcag gctggtctca aactcctggc ctcaagtgat ccgcctgact cagcctacca 240 
aagtgctgat tacaagtgtg agccaccgtg cccagc 276 



<210> 63 
<211> 289 
<212> DNA 
<213> HUMAN 

<400> 63 

cgccgggcac ggtggctcac gcctgtaatc ccagcacttt gggaggccaa ggcaggtgga 60 
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tcacgaggtc aagagatcaa gaccatcctg gccaacatgg tgaaacccca tctctactaa 120 
aaatacgaaa aaatagccag gcgtggtggc gggtgcctgt aatcccagct actcgggagg 180 
ctgaggcagg agaatggcat gaacccggga ggcagaagtt gcagtgagcc gagatcgtgc 24 0 
cactgcactc cagcctgggc aacagagcga gactcttgtc tcaaaaaaa 289 



<210> 64 
<211> 298 
<212> DNA 
<213> HUMAN 

<400> 64 

aggctgggct ctgtggctta cgcctataat cccaccacgt tgggaggctg aggtgggaga 60 
atggtttgag cccaggagtt caagacaagg cggggcaaca tagtgtgacc ccatctctac 120 
caaaaaaacc ccaacaaaac caaaaatagc cgggcatggt ggtatgcggc ctagtcccag 180 
ctactcaagg aggctgaggt gggaagatcg cttgattcca ggagtttgag actgcagtga 24 0 
gctatgatcc caccactgcc taccatcttt aggatacatt tatttattta taaaagaa 298 



<210> 65 
<211> 105 
<212> DNA 
<213> HUMAN 

<400> 65 

ttttttacat ctttagtaga gacagggttt caccatattg gccaggctgc tctcaaactc 60 
ctgaccttgt gatccaccag cctcggcctc ccaaagtgct gggat 105 



<210> 66 
<211> 83 
<212> DNA 
<213> HUMAN 

<400> 66 

cctcgaactc ctaggctcag gcaatccttt caccttagct tctcaaagca ctgggactgt 60 
aggcatgagc cactgtgcct ggc 83 



<210> 67 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 67 

agaaggtaag t 11 



<210> 68 

<211> 11 

<212> DNA 

<213> HUMAN 

<400> 68 

tggaggtgag a 11 
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<210> 69 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 69 

cagtcgtgag g 11 



<210> 70 
<211> 11 
<212> DNA 
<213> HUMAN 



<210> 71 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 71 

tggaggtacc a 11 



<210> 72 
<211> 11 
<212> DNA 
<213> HUMAN 



<210> 73 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 73 

agcaggtggg c 11 

<210> 74 
<211> 11 
<212> DNA 
<213> HUMAN 



<400> 70 
ccgaggtgag c 



11 



<400> 72 
ggaaggtcag t 



11 



<400> 74 
gccaggtaca g 



11 



<210> 75 
<211> 11 
<212> DNA 
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<213> HUMAN 



<400> 75 
tgctggtgag t 



<210> 76 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 76 
cacaggtatt a 



<210> 77 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 77 
atacagggga t 



<210> 78 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 78 
ccccaggcga c 

<210> 79 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 79 
acgcagtgca a 

<210> 80 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 80 
tttcagatcc a 



<210> 81 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 81 



ccccaggagg g 



11 



<210> 82 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 82 

tcacaggctc a 11 



<210> 83 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 83 

ccctagctcc a 11 



<210> 84 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 84 

ctccagtcca g 11 



<210> 85 
<211> 12 
<212> DNA 
<213> HUMAN 

<400> 85 

tcgcaggtga ca 12 



<210> 86 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 86 

acacagaagg g 11 



<210> 87 
<211> 377 
<212> PRT 
<213> HUMAN 

<400> 87 

Gin Arg Leu Pro Arg Met Gin Glu Asp Ser Pro Leu Gly Gly Gly Ser 
15 10 15 
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Ser Gly Glu Asp Asp Pro Leu Gly Glu Glu Asp Leu Pro Ser Glu Glu 
20 25 30 

Asp Ser Pro Arg Glu Glu Asp Pro Pro Gly Glu Glu Asp Leu Pro Gly 
35 40 45 

Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Glu Val Lys Pro Lys 
50 55 60 

Ser Glu Glu Glu Gly Ser Leu Lys Leu Glu Asp Leu Pro Thr Val Glu 
65 70 75 80 

Ala Pro Gly Asp Pro Gin Glu Pro Gin Asn Asn Ala His Arg Asp Lys 
85 90 95 

Glu Gly Asp Asp Gin Ser His Trp Arg Tyr Gly Gly Asp Pro Pro Trp 
100 105 110 

Pro Arg Val Ser Pro Ala Cys Ala Gly Arg Phe Gin Ser Pro Val Asp 
115 120 125 

lie Arg Pro Gin Leu Ala Ala Phe Cys Pro Ala Leu Arg Pro Leu Glu 
130 135 140 

Leu Leu Gly Phe Gin Leu Pro Pro Leu Pro Glu Leu Arg Leu Arg Asn 
145 150 155 160 

Asn Gly His Ser Val Gin Leu Thr Leu Pro Pro Gly Leu Glu Met Ala 
165 170 175 

Leu Gly Pro Gly Arg Glu Tyr Arg Ala Leu Gin Leu His Leu His Trp 
180 185 190 

Gly Ala Ala Gly Arg Pro Gly Ser Glu His Thr Val Glu Gly His Arg 
195 200 205 

Phe Pro Ala Glu He His Val Val His Leu Ser Thr Ala Phe Ala Arg 
210 215 220 

Val Asp Glu Ala Leu Gly Arg Pro Gly Gly Leu Ala Val Leu Ala Ala 
225 230 235 240 

Phe Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala Tyr Glu Gin Leu Leu 
245 250 255 

Ser Arg Leu Glu Glu He Ala Glu Glu Gly Ser Glu Thr Gin Val Pro 
260 265 270 

Gly Leu Asp He Ser Ala Leu Leu Pro Ser Asp Phe Ser Arg Tyr Phe 
275 280 285 

Gin Tyr Glu Gly Ser Leu Thr Thr Pro Pro Cys Ala Gin Gly Val He 
290 295 300 

Trp Thr Val Phe Asn Gin Thr Val Met Leu Ser Ala Lys Gin Leu His 
305 310 315 . 320 
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Thr Leu Ser Asp Thr Leu Trp Gly Pro Gly Asp Ser Arg Leu Gin Leu 
325 330 335 



Asn Phe Arg Ala Thr Gin Pro Leu Asn Gly Arg Val He Glu Ala Ser 
340 345 350 

Phe Pro Ala Gly Val Asp Ser Ser Pro Arg Ala Ala Glu Pro Val Gin 
355 360 365 

Leu Asn Ser Cys Leu Ala Ala Gly Asp 
370 375 



<210> 88 
<211> 34 
<212> DNA 
<213> HUMAN 

<400> 88 

tagacagatc tacgatggct cccctgtgcc ccag 34 



<210> 89 
<211> 34 
<212> DNA 
<213> HUMAN 

<400> 89 

attcctctag acagttaccg gctccccctc agat 34 



<210> 90 
<211> 3532 
<212> DNA 
<213> HUMAN 

<220> 

<221> misc_feature which includes the MN gene promoter 
<222> (1) . . (3532) 

<223> region including the transcription initiation site (nucleotide 3507 of 
SEQ ID NO: 5 and of Figures 2A-2F) as determined by RNase protection assay, 
which region is inclusive of the MN gene promoter, and corresponds to 
nucleotide 7 to nucleotide 3538 of SEQ ID NO : 5 and of Figures 2A-2F. 

<220> 

<221> unsure what base is at position 1968 
<222> (1968) 

<223> unsure of the base at position 1968, which is the same unknown base at 
position 1974 of SEQ ID NO: 5 (the full-length MN genomic sequence), position 
1968 of SEQ ID NO: 58 and position 647 of SEQ ID NO: 110. That unknown base 
is in the region that includes the transcription initiation site (nucleotide 
3507 of SEQ ID NO: 5 and of Figures 2A-2F) as determined by RNase protection 
assay, which region is inclusive of the MN gene promoter. 

<400> 90 

tgttgactcg tgaccttacc cccaaccctg tgctctctga aacatgagct gtgtccactc 60 
agggttaaat ggattaaggg cggtgcaaga tgtgctttgt taaacagatg cttgaaggca 120 
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gcatgctcgt taagagtcat caccaatccc taatctcaag taatcaggga cacaaacact 180 
gcggaaggcc gcagggtcct ctgcctagga aaaccagaga cctttgttca cttgtttatc 24 0 
tgaccttccc tccactattg tccatgaccc tgccaaatcc ccctctgtga gaaacaccca 300 
agaattatca ataaaaaaat aaatttaaaa aaaaaataca aaaaaaaaaa aaaaaaaaaa 360 
aaaagactta cgaatagtta ttgataaatg aatagctatt ggtaaagcca agtaaatgat 420 
catattcaaa accagacggc catcatcaca gctcaagtct acctgatttg atctctttat 480 
cattgtcatt ctttggattc actagattag tcatcatcct caaaattctc ccccaagttc 540 
taattacgtt ccaaacattt aggggttaca tgaagcttga acctactacc ttctttgctt 600 
ttgagccatg agttgtagga atgatgagtt tacaccttac atgctgggga ttaatttaaa 660 
ctttacctct aagtcagttg ggtagccttt ggcttatttt tgtagctaat tttgtagtta 720 
atggatgcac tgtgaatctt gctatgatag ttttcctcca cactttgcca ctaggggtag 780 
gtaggtactc agttttcagt aattgcttac ctaagaccct aagccctatt tctcttgtac 840 
tggcctttat ctgtaatatg ggcatattta atacaatata atttttggag tttttttgtt 900 
tgtttgtttg tttgtttttt tgagacggag tcttgcatct gtcatgccca ggctggagta 960 
gcagtggtgc catctcggct cactgcaagc tccacctccc gagttcacgc cattttcctg 1020 
cctcagcctc ccgagtagct gggactacag gcgcccgcca ccatgcccgg ctaatttttt 1080 
gtatttttgg tagagacggg gtttcaccgt gttagccaga atggtctcga tctcctgact 1140 
tcgtgatcca cccgcctcgg cctcccaaag ttctgggatt acaggtgtga gccaccgcac 1200 
ctggccaatt ttttgagtct tttaaagtaa aaatatgtct tgtaagctgg taactatggt 1260 
acatttcctt ttattaatgt ggtgctgacg gtcatatagg ttcttttgag tttggcatgc 1320 
atatgctact ttttgcagtc ctttcattac atttttctct cttcatttga agagcatgtt 1380 
atatctttta gcttcacttg gcttaaaagg ttctctcatt agcctaacac agtgtcattg 1440 
ttggtaccac ttggatcata agtggaaaaa cagtcaagaa attgcacagt aatacttgtt 1500 
tgtaagaggg atgattcagg tgaatctgac actaagaaac tcccctacct gaggtctgag 1560 
attcctctga cattgctgta tataggcttt tcctttgaca gcctgtgact gcggactatt 1620 
tttcttaagc aagatatgct aaagttttgt gagccttttt ccagagagag gtctcatatc 1680 
tgcatcaagt gagaacatat aatgtctgca tgtttccata tttcaggaat gtttgcttgt 1740 
gttttatgct tttatataga cagggaaact tgttcctcag tgacccaaaa gaggtgggaa 1800 
ttgttattgg atatcatcat tggcccacgc tttctgacct tggaaacaat taagggttca 1860 
taatctcaat tctgtcagaa ttggtacaag aaatagctgc tatgtttctt gacattccac 1920 
ttggtaggaa ataagaatgt gaaactcttc agttggtgtg tgtccctngt ttttttgcaa 1980 
tttccttctt actgtgttaa aaaaaagtat gatcttgctc tgagaggtga ggcattctta 2040 
atcatgatct ttaaagatca ataatataat cctttcaagg attatgtctt tattataata 2100 
aagataattt gtctttaaca gaatcaataa tataatccct taaaggatta tatctttgct 2160 
gggcgcagtg gctcacacct gtaatcccag cactttgggt ggccaaggtg gaaggatcaa 2220 
atttgcctac ttctatatta tcttctaaag cagaattcat ctctcttccc tcaatatgat 2280 
gatattgaca gggtttgccc tcactcacta gattgtgagc tcctgctcag ggcaggtagc 2340 
gttttttgtt tttgtttttg tttttctttt ttgagacagg gtcttgctct gtcacccagg 2400 
ccagagtgca atggtacagt ctcagctcac tgcagcctca accgcctcgg ctcaaaccat 2460 
catcccattt cagcctcctg agtagctggg actacaggca catgccatta cacctggcta 2520 
atttttttgt atttctagta gagacagggt ttggccatgt tgcccgggct ggtctcgaac 2580 
tcctggactc aagcaatcca cccacctcag cctcccaaaa tgagggaccg tgtcttattc 2640 
atttccatgt ccctagtcca tagcccagtg ctggacctat ggtagtacta aataaatatt 2700 
tgttgaatgc aatagtaaat agcatttcag ggagcaagaa ctagattaac aaaggtggta 2760 
aaaggtttgg agaaaaaaat aatagtttaa tttggctaga gtatgaggga gagtagtagg 2 82 0 
agacaagatg gaaaggtctc ttgggcaagg ttttgaagga agttggaagt cagaagtaca 2 8 80 
caatgtgcat atcgtggcag gcagtgggga gccaatgaag gcttttgagc aggagagtaa 2 94 0 
tgtgttgaaa aataaatata ggttaaacct atcagagccc ctctgacaca tacacttgct 3000 
tttcattcaa gctcaagttt gtctcccaca tacccattac ttaactcacc ctcgggctcc 3060 
cctagcagcc tgccctacct ctttacctgc ttcctggtgg agtcagggat gtatacatga 3120 
gctgctttcc ctctcagcca gaggacatgg ggggccccag ctcccctgcc tttccccttc 3180 
tgtgcctgga gctgggaagc aggccagggt tagctgaggc tggctggcaa gcagctgggt 3240 
ggtgccaggg agagcctgca tagtgccagg tggtgccttg ggttccaagc tagtccatgg 3300 
ccccgataac cttctgcctg tgcacacacc tgcccctcac tccaccccca tcctagcttt 3360 
ggtatggggg agagggcaca gggccagaca aacctgtgag actttggctc catctctgca 3420 
aaagggcgct ctgtgagtca gcctgctccc ctccaggctt gctcctcccc cacccagctc 3480 
tcgtttccaa tgcacgtaca gcccgtacac accgtgtgct gggacacccc ac 3532 
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<210> 91 
<211> 204 
<212> DNA 
<213> HUMAN 



<400> 91 

cctgcccctc actccacccc 
caaacctgtg agactttggc 
ccctccaggc ttgctcctcc 
acaccgtgtg ctgggacacc 



catcctagct ttggtatggg 
tccatctctg caaaagggcg 
cccacccagc tctcgtttcc 
ccac 



ggagagggca cagggccaga 60 
ctctgtgagt cagcctgctc 120 
aatgcacgta cagcccgtac 180 

204 



<210> 92 
<211> 132 
<212> DNA 
<213> HUMAN 



<400> 92 

ggatcctgtt gactcgtgac cttaccccca 
ccactcaggg ttaaatggat taagggcggt 
aaggcagcat gc 



accctgtgct ctctgaaaca tgagctgtgt 60 
gcaagatgtg ctttgttaaa cagatgcttg 120 

132 



<210> 93 
<211> 275 
<212> DNA 
<213> HUMAN 



<400> 93 

gcatagtgcc aggtggtgcc ttgggttcca 
ctgtgcacac acctgcccct cactccaccc 
acagggccag acaaacctgt gagactttgg 
tcagcctgct cccctccagg cttgctcctc 
acagcccgta cacaccgtgt gctgggacac 



agctagtcca tggccccgat aaccttctgc 60 
ccatcctagc tttggtatgg gggagagggc 12 0 
ctccatctct gcaaaagggc gctctgtgag 180 
ccccacccag ctctcgtttc caatgcacgt 240 
cccac 275 



<210> 94 
<211> 89 
<212> DNA 
<213> HUMAN 

<400> 94 

ctgctcccct ccaggcttgc tcctccccca cccagctctc gtttccaatg cacgtacagc 60 
ccgtacacac cgtgtgctgg gacacccca 89 



<210> 95 
<211> 61 
<212> DNA 
<213> HUMAN 

<400> 95 

cacccagctc tcgtttccaa tgcacgtaca gcccgtacac accgtgtgct gggacacccc 60 
a 61 



34 



<210> 96 
<211> 116 
<212> DNA 
<213> HUMAN 

<400> 96 

acctgcccct cactccaccc ccatcctagc tttggtatgg gggagagggc acagggccag 
acaaacctgt gagactttgg ctccatctct gcaaaagggc gctctgtgag tcagcc 



<210> 97 
<211> 36 
<212> PRT 
<213> HUMAN 

<400> 97 

Gly Glu Glu Asp Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp 
15 10 15 

Pro Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu 
20 25 30 

Glu Asp Leu Pro 
35 



<210> 98 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 98 

Gly Glu Glu Asp Leu Pro 
1 5 



<210> 99 
<211> 4 
<212> PRT 
<213> HUMAN 

<400> 99 
Glu Glu Asp Leu 
1 



<210> 100 
<211> 5 
<212> PRT 
<213> HUMAN 

<400> 100 

Glu Glu Asp Leu Pro 
1 5 
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<210> 101 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 101 

Glu Asp Leu Pro Ser Glu 
1 5 



<210> 102 
<211> 7 
<212> PRT 
<213> HUMAN 

<400> 102 

Glu Glu Asp Leu Pro Ser Glu 
1 5 



<210> 103 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 103 

Asp Leu Pro Gly Glu Glu 
1 5 



<210> 104 
<211> 22 
<212> PRT 
<213> HUMAN 

<400> 104 

Gly Gly Ser Ser Gly Glu Asp Asp Pro Leu Gly Glu Glu Asp Leu Pro 
15 10 15 

Ser Glu Glu Asp Ser Pro 
20 



<210> 105 
<211> 25 
<212> PRT 
<213> HUMAN 

<400> 105 

Gly Glu Glu Asp Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp 
15 10 15 

Pro Pro Gly Glu Glu Asp Leu Pro Gly 
20 25 



<210> 106 
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<211> 24 
<212> PRT 
<213> HUMAN 

<400> 106 

Glu Asp Pro Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro 
15 10 15 

Gly Glu Glu Asp Leu Pro Glu Val 
20 



<210> 107 
<211> 7 
<212> PRT 
<213> HUMAN 

<400> 107 

Gly Glu Thr Arg Ala Pro Leu 
1 5 



<210> 108 
<211> 7 
<212> PRT 
<213> HUMAN 

<400> 108 

Gly Glu Thr Arg Glu Pro Leu 
1 5 



<210> 109 
<211> 7 
<212> PRT 
<213> HUMAN 

<400> 109 

Gly Gin Thr Arg Ser Pro Leu 
1 5 



<210> 110 
<211> 1247 
<212> DNA 
<213> HUMAN 

<220> 

<221> misc_f eature 
<222> (1) . . (1247) 

<223> region 5' to the transcription initiation site as determined by RNase 
protection assay (nucleotide 3507 of SEQ ID NO: 5 and of Figures 2A-2F) in 
which an activating element is localized, which region corresponds to 
nucleotide 1328 to nucleotide 2574 of SEQ ID NO: 5 and of Figures 2A-2F. 

<220> 

<221> unsure what base is at position 647 
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<222> (647) 

<223> unsure of the base at position 647, which is the same unknown base as 
that at position 1974 of SEQ ID NO: 5, and as that at position 1968 of SEQ ID 
NOS: 58 and 90. That unknown base at position 647 is in a region in which an 
activating element is localized and is 5 1 to the transcription initiation site. 

<400> 110 

tatgctactt tttgcagtcc tttcattaca tttttctctc ttcatttgaa gagcatgtta 60 
tatcttttag cttcacttgg cttaaaaggt tctctcatta gcctaacaca gtgtcattgt 120 
tggtaccact tggatcataa gtggaaaaac agtcaagaaa ttgcacagta atacttgttt 180 
gtaagaggga tgattcaggt gaatctgaca ctaagaaact cccctacctg aggtctgaga 240 
ttcctctgac attgctgtat ataggctttt cctttgacag cctgtgactg cggactattt 300 
ttcttaagca agatatgcta aagttttgtg agcctttttc cagagagagg tctcatatct 360 
gcatcaagtg agaacatata atgtctgcat gtttccatat ttcaggaatg tttgcttgtg 420 
ttttatgctt ttatatagac agggaaactt gttcctcagt gacccaaaag aggtgggaat 480 
tgttattgga tatcatcatt ggcccacgct ttctgacctt ggaaacaat.t aagggttcat 540 
aatctcaatt ctgtcagaat tggtacaaga aatagctgct atgtttcttg acattccact 600 
tggtaggaaa taagaatgtg aaactcttca gttggtgtgt gtccctngtt tttttgcaat 660 
ttccttctta ctgtgttaaa aaaaagtatg atcttgctct gagaggtgag gcattcttaa 720 
tcatgatctt taaagatcaa taatataatc ctttcaagga ttatgtctt't attataataa 780 
agataatttg tctttaacag aatcaataat ataatccctt aaaggattat atctttgctg 840 
ggcgcagtgg ctcacacctg taatcccagc actttgggtg gccaaggtgg aaggatcaaa 900 
tttgcctact tctatattat cttctaaagc agaattcatc tctcttccct caatatgatg 960 
atattgacag ggtttgccct cactcactag attgtgagct cctgctcagg gcaggtagcg 1020 
ttttttgttt ttgtttttgt ttttcttttt tgagacaggg tcttgctctg tcacccaggc 1080 
cagagtgcaa tggtacagtc tcagctcact gcagcctcaa ccgcctcggc tcaaaccatc 1140 
atcccatttc agcctcctga gtagctggga ctacaggcac atgccattac acctggctaa 1200 
tttttttgta tttctagtag agacagggtt tggccatgtt gcccggg 1247 



<210> 111 
<211> 17 
<212> DNA 
<213> HUMAN 

<400> 111 

ctctgtgagt cagcctg 17 



<210> 112 
<211> 23 
<212> DNA 
<213> HUMAN 

<400> 112 

aggcttgctc ctcccccacc cag 23 



<210> 113 
<211> 18 
<212> DNA 
<213> HUMAN 

<400> 113 

agactttggc tccatctc 18 
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<210> 114 
<211> 20 
<212> DNA 
<213> HUMAN 

<400> 114 

cactccaccc ccatcctagc 20 



<210> 115 
<211> 26 
<212> DNA 
<213> HUMAN 

<400> 115 

gggagagggc acagggccag acaaac 2 6 



<210> 116 
<211> 20 
<212> PRT 
<213> HUMAN 

<400> 116 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
15 10 15 

Gly Gly Gly Ser 
20 
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